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Abstract. Herein, we introduce and study a new class of discrete random fields designed for quick simula- 
tion and covariance inference under inhomogeneous condition. Simulation of these correlated fields can be 
done in a single pass instead of relying on multi-pass convergent methods like the Gibbs Sampler or other 
Markov Chain Monte Carlo methods. The fields are constructed directly from specified marginal proba- 
bility mass functions and covarianccs between nearby sites. The proposition on which the construction is 
based establishes when and how it is possible to simplify the conditional probabilities of each site given the 
other sites in a manner that makes simulation quite feasible yet maintains desired marginal probabilities 
and covariances between sites. Special cases of these correlated fields have been deployed successfully in 
data authentication, object detection and image generation. The limitations that must be imposed on 
the covariances and marginal probabilities in order for the algorithm to work are studied. What's more, 
a necessary and sufficient condition that guarantees the permutation property of correlated random fields 
are investigated. In particular, Markov random fields as a subclass of correlated random fields are derived 
by a general and natural condition. Consequently, a direct and flexible single pass algorithm for simulating 
Markov random fields follows. 

1. Introduction 

Random fields are widely used in sciences and technologies to model spatially distributed random phe- 
nomena or objects. Within sciences, random fields are used in geophysics, astrophysics, statistical mechan- 
ics, underwater acoustics, structural biology and agriculture. Applications of random fields in technologies 
include TV signal processing, image processing in photography such as medical images (human brain 
imaging, functional magnetic resonance imaging, mammography), computer vision, web data extraction, 
clustering gene expression time series, natural language processing etc. Readers are referred to pQ, [S], 
[12) . [15] . [15] . [16], [17], and [18] for those applications. Technologically, researchers of random fields have 
dealt either with the modeling of images (for synthesis, recognition or compression purposes) or with the 
resolution of various spatial inverse problems (image restoration and reconstruction, deblurring, classifi- 
cation, segmentation, data fusion, optical flow estimation, optical character recognition, stereo matching, 
finger print classification, pattern recognition, face recognition, intelligent video surveillance, sparse signal 
recovery, natural language processing like Chinese chunk and so on, see [4], [6], [13], [14], and [To]). 

Scientists and technicians are interested in the inverse problems such as image restoration, boundary 
detection, tomographic reconstruction, shape detection from shading, and motion analysis. Many precisely 
formulated mathematical models were constructed to model certain types of random fields, and various 
methods and estimators have been developed to make the proposed models work in application. There 
are diverse needs calling for simulating random fields. For example, simulation is employed to calculate 
minimum mean square (MMS) and maximum posterior marginal (MPM) estimators, see [15] . Simulation 
can also be a potential smoothing technique. In the chapter 2 of Winkler [15] , various smoothing techniques 
were proposed to clean "dirty" pictures. Most of these methods involve simulation. The difficult problem 
is how do we effectively simulate random fields. A typical simulation would involve 100,000 or more highly 
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correlated random variables, which would certainly exceed the capacity of modern computers if one tried 
to simulate the whole random field directly. 

Researchers frequently resort to imposing discrete Markov assumptions on their random fields to be 
simulated out of practical need. In this regard, the Gibbs sampler was proposed to ease this simulation 
difficulty. Briefly speaking, a Gibbs sampler starts with a given initial configuration (i.e. potential real- 
ization of the random field) or a configuration chosen at random from some initial distribution, and then 
updates its configuration site by site based on the local characteristics of the random field. Once all sites 
of a configuration are sequentially updated, a sweep or a pass is finished. A Gibbs sampler usually takes 
hundreds of sweeps to produce a configuration closely consistent with a given distribution and there are 
still computational and convergence issues to deal with. 

In this paper, we propose a new class of discrete correlated random fields which incorporate given 
probability mass functions (pmfs) {7r Si } for all sites S — {si}fL 1 and given covariances /3 Si)Sj between 
nearby sites. These fields are dsigned with efficient simulation in mind. The number of possible random 
configurations within a general discrete random field can be enormous and simulation is further compli- 
cated when the sites are correlated with one another. These factors can make Gibbs sampling and other 
Markov chain Monte Carlo simulation impractical. However, Proposition 1 2 . II on which our fields are based 
establishes a method to imbed desired covariances and marginal probabilities into a random field while 
maintaining simulation ease. Indeed, Proposition 12.11 is a simple means to construct some site-by-site 
conditional probabilities consistent with given marginal probabilities and site-to-site covariances in such a 
way that sampling the missing portion of a random field sequentially is very feasible. More precisely, when 
simulating a new site, we compute this conditional probability mass function of its state conditioned on the 
known portion and the previously-simulated sites. This construction establishes the sequential simulation 
property of our correlated random field, that is to say, we can actually construct a random field in one pass 
based on this algorithm, reducing the computation over the Gibbs sampler dramatically. For demonstration 
purposes, we discuss application of our random fields and simulation algorithm to Data Authentication, 
Object Detection and Image Creation. 

The constraints and properties of the random fields generated by Proposition 12 . 1 1 are discussed in detail. 
The necessary and sufficient conditions of the regularity type are given in Proposition 15.11 when base set 
is a singleton. In particular, we investigate the conditions related to the marginality and permutation 
properties of our random fields. In the case where one wants to match the covariances between each pair 
of sites for S = {si}" =1 , the field takes the form (See Lemma IBTTj) 
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where {7r Si } and {7r Si } are two auxiliary collections of pmfs, and we study conditions on 7r S; , n Si , 7r Si and 
P Si ,sj that ensure the same random field X$ is constructed, regardless of site ordering. In addition, if 
■K Si = TT Si VI < i < n, then each pair of uncorrelated sites are actually independent. These results are 
given in Corllarv 15.21 Proposition 15.31 and Corllarv 15.31 The predominance followed from these results is 
to simulate true Markov random fields rather than just correlated random fields on a general site 
space (5,9). For a given (S,d), its neighborhood system d can be extended to d' (under d' all sites in 
S are neighbors of each other) by setting those pair of sites which are not neighbors of each other have 
covariances 0, i.e., for each s € S, if any t ^ d(s), let f3 s , t — 0. Herein, we develop a direct and flexible 
algorithm from Proposition 15.31 to generate site-order invariant Markov random fields, i.e., the simulated 
Markov random field does not depend on the particular site order generating it. This subclass of random 
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fields require stronger assumptions and have nicer properties, compared to the correlated random fields 
generated by Proposition 12. II 

The remainder of this note is laid out as follows: Section [2] contains our notation and the statement 
of our main results, Proposition 12.11 Next, we explain our simulation algorithm in Section [3] In section 
IH we summarize our prior applications of our simulation algorithm to image analysis. In Section [5j we 
explore its properties and constraints. We give the necessary and sufficient conditions of the regularity 
type for singleton case of A Si in Proposition 15. II We also give a necessary and sufficient condition for the 
permutation property of a random field in Proposition 15.21 and 15.31 An effective algorithm for generating 
Markov random field follows from these propositions. Section [6j the Appendix, contains our proofs of 
Lemma 12.11 and of Proposition 12.11 

2. Notation and Background 

In this section, we recall some random field notation from Winkler |15) . introduce new concepts, and 
state a proposition from which our novel random fields and algorithm follow. Let S be a finite index set 
of sites; and for each site s £ S, X s be a finite space of states at site s. For nonempty A C S, denote 
the space of configurations xa — {x s )seA on A by Xa = Y\ s ^a H A = S, we abbreviate Xg by X , i.e., 

* II. 

Let LI denote a probability measure or distribution on X. If for every x G X, H(x) > 0, i.e., II is 
a strictly positive probability measure on X, then II is called a random field. We also call the random 
vector X — (X s ) s& s on the probability space (X, II) a random field. For a nonempty A C S, define the 
projection map from X onto Xa as follows: 

X A : x -> x A , 

where x £ X and xa € Xa- 

The definitions of neighborhood system and Markov random field follow directly from Winkler |15j : 

Definition 2.1. A neighborhood system d = {d(s) : s £ S} of S is any collection of subsets of S that 
satisfies the conditions: (i) s ^ <9(s) for every s £ S and (ii) s £ d(t) if and only if t £ d(s). The sites 
t £ d(s) are called neighbors of s. 

Definition 2.2. The random field II is a Markov random field with respect to the neighborhood system 
d if for all x £ X, 

H{X S = x s \X t = Xu t^s) = U(X S = x s \X t = x t ,t £ d(s)). 



We will introduce the notions of base set and one pass simulatable respectively. These concepts 
rely on the following definitions and discussion. First, the neighborhood of nonempty set generalizes the 
neighborhood of single site: 

Definition 2.3. The neighborhood of nonempty A C S is d(A) — I d(s) \ \A, that is the neighbors 

of the sites in A that are not part of A themselves. For convenience, let 9(0) = S, i.e., the neighborhood 
of empty set is all sites S. 

Next, we define exclusiveness, connectedness and separateness for subsets of site space S based 
on the neighborhood of nonempty set: 

Definition 2.4. Two subsets -61,-62 C S are exclusive from each other if [-61 (J d(Bi)] f] B2 = (or 
Bi f][B2 |J ^(.62)] = 0). A subset B C S is connected, if for any nonempty proper subset A c B, 
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d(A) p| B 7^ (If a subset B C S contains only one site or is empty, B is connected, since 9(0) — S). A 
subset B C 5 is separated if it is not connected. 

In the above definition, [B\ [J d(Bi)} f] B^ = and B\ f][B2 U <9(i?2)] = are equivalent, i.e., one implies 
the other. Separateness of a nonempty subset B implies that there exist two subsets £?i,i?2 which are 
exclusive such that B = B\ (J B^- 

In some applications, e.g. image restoration and shape detection from shading, a site space S is divided 
into two parts: good part and bad part, or known part and unknown part. Herein, we let H C S denote 
the bad or unknown part, and H = S \ H the good or known part of the site space S. For a nonempty 
subset H of a connected space (S,d), we can order its sites sequentially and associate each site with a 
connected set, which is a subset of the neighborhood of the site: 

Lemma 2.1. Assume that (S, d) is a connected space of N > 2 sites and H C S a subset of n > 1 
sites. Then, the sites in H can be ordered as a sequence {si}™ =1 such that Si <E d(H c lj{si, Si_i}) 
for 1 < i < n. In addition, there exist unique mi > 1 and connected subsets {B J Si }"li 1 (called connected 

mi 

components) such that <9(si) f^|[-ff C [_J{ s ii s i-i}] = U Bf. and {-B^}™^ are exclusive from each other. 

i=i 

We choose one component among {-B^}™^ and denote it by A Si (except A Sl = if H = S) for 1 < i < n. 

Lemma [2TT1 is proved in Section^ Neither the site order nor the connected subsets {A Si }™ =1 are unique. 
Rather, we just assume henceforth that a particular setup has been chosen. 

Definition 2.5. Suppose that (S, d) is a connected space of N > 2 sites and H C S & subset of 1 < n < N 
sites. Then, a valid setup for H is an ordered collection {(si, A Si )}" =1 such that H — Ur=i{ s «}> s i e 
d{H c lj{si, Sj-i}) for 1 < i < n and A Si C 9(si) P|[if c lj{si, Si_i}] is one connected component of 
9(si) f][H c lj{si, Si-i}] for 1 < i < n. The set A Si is called the base set for site Si. 

Example 2.1. When H = S, a valid setup for H is an ordered collection {(si, A Si )} 1 jL 1 such that A Sl = 0, 
Si G 9({si, Si-i}) and A Si being a connected component of <9(si) n{ s ii •••) s i-i} f° r 2 < i < N. 
We are really interested in using a valid setup to simulate the unknown portion of a random field. 

Definition 2.6. The unknown states H of a discrete random field II are one pass or sequentially simulatable 
with valid setup {(si, A Si )}l l =1 if 

IL{X Si =x Si \X Si _ 1 = x Si _^, X S1 =x Sl ,X H c = x H c) = U(X Sz =x Si \X As . = x As .)- 

Then, by the multiplication rule one has that for any configuration x £ X is given: 

n 

TL(X H = x H \X H c = x H c) = JJn(A Si = x Si \X Si _ 1 = x Si _ l: X Sl = x Sl , X H c = x H c) 

i=l 
n 

= Y[U(X Si =x Si \X As .=XA H ). (2.1) 
i=i 

From this configuration probability, we can find the probability of any set of sites B. When B C {sx, Sj} 
for some j, we need only compute the product of the first j terms 

3 

n(Ag = xb\Xjjc — xhc) — J2 II 11 ^ =*s k \X Aak =x A . h ), (2.2) 

s k <£Bk=l 

and this formula can be used to compute H(Xa s . = xa s . \X h c = Xjjc) in (|2.3I) below. 

Kolmogorov's consistency conditions consist of permutation condition and marginality condition. These 
conditions were applied to Kolmogorov's extension theorem to gurantee the unique existence (almost surely) 
of stochastic process that can degenerate to given families of random vectors. These conditions also light 
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up those good properties that Markov random fields satisfy. For each integer 1 < n < N, let M n contain 
n elements and Gm„ be the symmetric group consisting of all permutations on M n . Then we define the 
consistency conditions for discrete random field as follows: 

Definition 2.7. Assume that (S, d) is a connected space of N > 2 sites, H is a subset of n > 1 sites with 
valid setup {(s^, A Si )}" =1 and X H c be the known portion of the random field. For each 1 < k < n, and 
each Mk = {ii, ifc} C {1, • • • , n} and g £ G]\i k , let H Sg(il - ) ,...,s g ^ 5 ('\Xhg = x H c) (if it can be defined the 
same way as U Siit ,,, tSih (-\X H c = x H c)) be a probability mass function on X Sg(n)i ... iSg( . fc) = n«=i X SgM ■ 
The two consistency conditions for the collection of probability mass functions {H Sg{iih ... tSg{ik) (-\X H c = 
x h c ) '■ 9 € Gu k ,Mk — {hi — >*fe} ^ {!)"*' j"}: 1 < fc < n} are listed as follows. 

(1) permutation: for each permutation g £ G^, = {ii, ik} C {1, • • • , n} (1 < k < n), and 
x Si £ X Siu (1 < u < fc), 

n Sg(n) ,..., S9(lfc) (i» B(<l) , -, x Sg(ifc) |X H c = x H c) = U Sii r .. >Sik (x Sil , x Stk \X H c =x H c); 

(2) marginality: for each permutation g £ Gi\j k , Mk = {ii,...,ik} Q {1, • • • , n} (2 < k < n) and 

1 < 3 < k, 

x 3 6X, . 

= R Sg ( il) ,...,s gli ._ l) ,s g(l . +l) ,...,8 g(ik) (x Sg ( H) , ••■> :z; s g( , j ._ l) ,a; Sa( ^ +i) , -,x Sg ( ik ilXffQ = x H c), 

where x Sg(iu) € X Ss(iii) (1 < u ^ j < k). 

Note that in the above definition, it can happen that some n Sg( . i)! ... !Sg( . fc) {-\Xfjc = xjjc) are not defined. 
For example, for g on M n — {1, • • • the sites in H can be ordered by g as {s g (i)}™ =1 , but it may fail 

that Sg^i) £ d(H c lj{ s s(i)i ' ' ' > s g{i-i)}) f° r some 1 < i < n. In such case, we do not have II Sg(1) s . . 

defined. But if n Sg(1) ,..., Sg(n ) is defined indeed, then the permutation property states that the random fields 
on H generated by the order X Sl , X Sn and the order X Sg{1} , X Sg{n) are the same. So the condition 
for permutation property assures that Xh on H can be simulated in any site order, given X^ can be 
generated in such order. Note also permutation property is not required or recommended for some type 
of random fields such as discrete time series. The marginality condition ensures that random field Xh 1 
generated directly on a proper subset Hi C H is the same as that degenerated from Xh- 

When simulating X Si , one has access to X H c lj{^si, ■■■,X Si _ 1 }, the known sites and the sites already 
simulated, but need only use Xa s .. This reduction is the key that makes one-pass sequential simulation 
effective when the base sets are relatively small compared to H. We will explain how this can be done when 
one is just interested in simulating a field with given marginal probabilities and certain covariances. For 
the case H — S, since H c = 0, the conditioning X H c = x^c disappears from (|2.1[) and (12.21) . leading to 
an easier-to-assimulate means of constructing a random field. Conversely, the case H C S is advantageous 
for real applications. 

Herein, we simulate random fields with given marginal probabilities for sites and given covariances 
between nearby (meaning within the base sets however they are defined) sites for a subset if of a connected 
space (S, d) with N sites. We assume a valid setup {(s^, ^4 Si )}?=i for H . Our algorithm constructs Xh with 
the given marginal probabilities {Tt Si (x Si ) : x Si £ X Si }" =1 and the given covariances between nearby sites 
{P Si ,ti ■ ti £ A Si } for 1 < i < n. (It is assumed a priori that these marginal conditions hold within the used 
set of known sites (U™ =1 A Si )\H. It is also assumed that /8 a<1 t 4 = A 4l a< f° r U € A Si (1 < i < n) since (3 Si ,ti will 
denote covariance between X Si and X ti ). We assign conditional probabilities Tl(X Si — x Si \ Xa s . — xa s .), 
such that we maintain the desired covariances and marginal probabilities as we include the unknown sites: 
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Proposition 2.1. Assume that (S,d) is a connected space of N > 2 sites, H is a subset of n > 1 
sites with valid setup {(sj, and Aye is the known portion of the random field. Suppose further 

that {tt s (x s ) : a: s G X s ,s G and {7r s (x s ) : x s € X s ,s G S 1 } are two sets of pmfs. Assume that 
{7r s (a; s ) : x s G X s ,s G S} are positive pmfs and {0 Si ,u '■ U € A Si , 1 < i < n} are numbers such that the 
right hand side (RHS) of (|2.3|) is in [0,1] for all i. Form the conditional probabilities starting with i = 1 
recursively as 



for each x Si G X Si and x^ s . G ~X-a s . (1 < i < n), where /z# a = tt s (x s )x s and cr| s = tt s (x s )( 



M-jfa) ( s S 5). Then, there is a probability measure II on X(u?_ A 3 )uh consistent with p.3| that has 
marginal probabilities {7r Si } and covariances cov(X Si , X ti ) — @ Si ,ti for all ti € A Sil 1 < i < n. 



Remark 2.1. Proposition ^ . 1 1 can be used in image smoothing when H C S. For example, an image X can 
be smoothed in a way as follows: let H be the set of pixels with states consisting of "sharp" or undesirable 
values, and replace those values by simulated ones, using Proposition l2.il H c is the portion of the picture 
that does not require smoothing. 

Remark 2.2. The special case H = S of Proposition 12.11 deserves particular attention. When H — S, 
A Sl = 0, the second term on the right hand side of ()2.3j) disappears since the summation is over zero terms. 
H(Xa b . — xa s .\X h c — x H c) in (|2.3|) should be replaced by XV(Xa s . = %A 3i ) because H c — S c = 0. 
Different from the case H C S in Remark 12.11 Proposition 12.11 with H = S is used to generate random 

N 

field X on the site space S: Tl(X = x) = J^J n(X Si = x Si \Xa 3 . = xa s . )■ 

i=l 

Remark 2.3. In Proposition 12.11 we assumed that tt s {x s ) > : Vx s G X s for each s € S. Note that X s 
can be different for each s S S. For given s, if there exists a x s £ X s such that it s (x s ) = 0, we may deem it 
uninteresting and replace X s with X s \ {x s }. Therefore the positive probability mass function assumption 
of 7r s is also a convention. But for tt s and tt s , we do not need to have this assumption, i.e. the support of 
tt s (x s ) or tt s can be a proper subset of X s . 

Remark 2.4. In general we will match covariances between a site s and sites in A c s ov = A s U {t : s G At]. 
This is one of the connected components in d(s). 

Remark 2.5. By definition, connectedness is based upon the notion of neighbor, and so is the Markov 
property of a Markov random field. The connectedness condition of the site space S guarantees that we 
can use the accumulated information about the neighbor states X ti , ti £ A Si , to generate the state X Si . 

A random field generated by Proposition 12.11 is a correlated random field. Indeed, one value of this 
proposition is the assertion that there are one-pass simulatable correlated random fields that match a 
given collection of marginal probabilities and covariances. What's more, we will state conditions in Section 
[5] that convert a correlated random field into a Markov random field. We call these one-pass simulatable 
correlated random fields generated by Proposition 12.11 the KNW Random Fields for ease of future 
reference. 



Remark 2.6. Proposition 12. II has a few interesting and important special cases: 
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(1) If Tt s (x s ) — tt s (x s ),x s £ X s for all s £ S, then (I2.3[) becomes 



U(X Sz = X Si \X As . = XA S . ) = 7T Si (x Si ) + I Yl ^Ui(x Uz )Y ^ 



«iG^4 s 



(2.4) 



where A Si = A Si U{ s i}- I n (|2.4|) . two auxiliary collections of pmfs are reduced to one. 
(2) If 7r s (x s ) = n s (x s ) — n s (x s ), x s € X s for all s£S, then (|2.3p is 



II(X S! = s Si |XA s . = a;^ s 



i+ ( n khm)- 



Ui£A s . 



(x Si - fJ, 7Vs . )p Si M (x ti - fl w ) 



(2.5) 



There are no auxiliary collections of pmfs in (|2.5I) . 
(3) If tt s (x s ) = 7r s (x s ) = -tf-,x a £ X s for all s £ S where d s is the cardinality of X s , then (|2.3p has the 
following form 

(x Si - p. gi )0 ai , ti (x u - p, ti ) 



U(X Si = x Si \X Aa . = xa h ) = n Si (x Si ) + ^2 



(2.6) 



, i s D A ei a H a U^( X A ei =XA Si \X H c =x H c) 

where fi a — — x s and a 2 = — (x s — p, s ) 2 for all s G 5 and Dj b = d Ui . Here 

s i,ex s s i,6X a uieA Si 

two collections of auxiliary pmfs take same discrete uniform pmfs respectively. The simplicity in 
(|2.6I) reduces the computation of conditional probabilities. Notice the p, a and a 2 calculations are 
simplified. 

(4) If tt s (x s ) — -j- and tt s (x s ) = tt s (x s ), Vx s £ X s for all s £ S with d s being the cardinality of X s , 
then ()2.3j) is changed to 



n ( X Si = x Si | X As . = x A ) = vr Si (x Si ) 



ti£A Si 



(x Si - n 7Tsi )l3 Su t i TTt i (x ti )(x ti - (1-k h ) 

D As i \{t i }<7%..<T% t .' n { X A. i = XA Si \X H c =X H C 



(2.7) 



where D As \{ t .} = ]Q d Ui . Here one collection of auxiliary pmfs take discrete uniform pmfs 

Ui£A Bi \{ti} 

and another collection is idcntitical to the prescribed {7r s }. 
(5) If we assume the sufficient condition for permutation property of a random field (see Corollary 
I5.2|) : tc s (x s ) = tt s (x s ),Vx s £ X s for all s £ S, then (|2.3|) becomes 



n( x Si = x Si | Xa 3 . = x Aa . ) 

7f Sj (a; Si )(a; Sj 
^ ) H 2 ~ 



al t Ii(X A ^ = x As . \X H c = x H c 



(6) Assume that the space of states X s for all s € S is same and is denoted by X. If tt s (x s ) — tt(x s ) 
and tt s (x s ) = %(x s ), x s £ X for all sites s £ S, then (|2.3[) is adapted to 



II( X Si = x Sz | X Ag . = 2U s . ) 

/ \ . ^(» Si )(^ Si - Ms) 

(a; Ss ) H 2 



E ( n *( x *. 
ueA ai \i&A Si \{ti} 



Psi,ti*( x u)( x ti ~ Mgj 

(T?n(XA 3i = X Asi \X H c = X H c 



(2.9) 



where /i# = 7r(x s )a; s and erf = 7r(a; s )(a;<; — /i#) 2 (s G 5). In this case, auxiliary pmfs {tt s } 

a?sGX a; s GX 

are identically distributed, so are {tt s }- 
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(7) If we combine the assumptions for formula (I2.6P and (|2.9p together, i.e., X s = X and tt s (x s ) = 
tt s (x s ) = 2> x s € X for all s g S where d is the cardinality of X, then (|2.3j) takes the following 
simple form 

n(JC 5i =x Si |X A =a; A )=7r 5i (a; Si )+ ^ .u. i+i/- 2mT rf'y iv T (2 ' 10) 

t ^W A "i\+'-(a 2 yil(X Asi =XA s .\X H c=x H o) 

where fi = — x s , a 2 — — (a; s — ft) 2 (s € S") and |j4 s J is the cardinality of A Si . Formula 

(|2.10l) is used in [10] for Captcha generation. 

Insomuch as the case H ^ S only involves a notational change, the remainder of this note will only 
consider the case H = S and will not state it anymore. For example, in the appendix we will only prove 
Proposition 12. II for the case H = S. 



Example 2.2. Let S = {1,2,3,4,5}, Sj = i for 1 < i < 5 and 9 contain the following subsets of S: 
9(1) = {2,4}, 9(2) = {1,3,5}, 9(3) = {2,4,5}, 9(4) = {1,3,5} and 9(5) = {2,3,4}. Then, 9 satisfies the 
conditions of a neighborhood system, i.e. (i) i ^ d(i) for i G S and (ii) i g 9(j) if and only if j € 9(i). The 
connected space (S, 9) is drawn as follows: each site (1 < i < 5) is represented by a node and each pair of 
neighbors are connected by a dashed edge. 



By previous discussion, we can take A 1 = 0, A 2 = 9(2)f|{l} = {1}, A 3 = d{3)f]{l,2} = {2}, 
Ai C 9(4) f|{l, 2, 3} = {1, 3}, A 5 = 9(5) f|{l, 2, 3, 4} = {2, 3, 4}. Now, {1, 3} is not connected because {1} 
is a proper subset and 9({1}) f]{l, 3} = 0. Hence, we choose A4 = {3} (We could have chosen A4 = {1} 
as well). Note that A$ = {2,3,4} is a connected set with respect to the neighborhood system 9. 

Let the common space of states for each site be X = { — 1,0, 1}. To construct a probability measure 
II on X = { — 1,0, l} 5 , we use Proposition 12.11 to compute the conditional probabilities: II(Xl = xi), 
IL(X 2 = x 2 \X% = xi), U(X 3 = x 3 \X 2 = x 2 ), II(Xt = X4\X 3 = X3), and U(X 5 = x 5 \Xa 5 = xa 5 ) where 
Xi e { — 1,0,1} for 1 < i < 5 and xa 5 G { — 1,0, l} 3 . To compute H(X^ = x&\Xa b = xa 5 ), we have to 
compute H(Xa 5 — xa 5 ) as a prerequisite, using Proposition 12. II By (12.11) and (|2.2p . first we compute the 
joint probabilities 

U(X 4 = Xi,X 3 = x 3l X 2 = x 2 ,Xi = xi) = n(X 4 = Xi\X 3 = a; 3 )II(X3 = x 3 \X 2 = x 2 ) 

xn(I 2 = x 2 \X 1 = x 1 )Ii(X 1 = xi), 

Since A 2 = {1}, A 3 = {2} and A 4 = {3}, we get 

U(Xa 5 = xa 5 ) = ^2 U{Xi=Xi,X 3 = x 3 ,X 2 =x 2 ,Xi=x\). 
xie{-i,o,i} 

Alternatively, with the connectedness of A5, we can treat A§ as S, use Proposition 12.11 again, and 
compute H(Xa 5 — %a & ) as follows: 

U(X A , = xa 5 ) = n(X 4 - x 4 \X 3 = x 3 )U(X 3 = x 3 \X 2 = x 2 )Tl(X 2 = x 2 ). 

This later alternative method is more efficient than the former one. □ 
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Remark 2.7. Example 12.21 shows how to use the connectedness assumption on A Si (2 < i < N) in 
Proposition 12.11 By Definition 12.51 A Si (2 < i < N) is chosen to be one of the connected components of 
d(si)f]{st,...,Si-i_}. For each site (1 < i < N), let A^° v = A S{ \J{sj :s,ei Sj ,l<j< N}. Herein, 
A c s ° v is the largest subset of d(si) containing A Si such that the covariances between X Si and X ti (U € A c s ° v ) 
are matched. For the choice A d (1 < i < 5) in ExampleES Af v = {2}, Af v = {1,3,5}, Af v = {2,4,5}, 
j^cov _ {3^5} anc ] Af v — {2,3,4}. In the following diagram, those neighbors with matched covariances 
are connected by thick edges, and the covariance between X\ and X4 are not matched: 




Remark 2.8. If S is connected and A Sl = d{si) f]{s u Si-i} for 2 < i < N, then A c s ° v = d{si), 
VI < i < N. Here A Si is the only connected component of d(si) f]{si, s;-i}. Herein we can prescribe 
{(3 s ,t '■ t £ d(s), s & S} such that the random field X constructed by Proposition 12 . 1 1 satisfies cov(X s , X t ) = 
f3 Sj t for any pair of neighbors s, t € S (without reference to the sets A s ). 

As introduced in Section [TJ random fields have typical applications in two-dimensional space (e.g. X- 
ray imaging) and three-dimensional space (e.g. human brain imaging and mammography). Herein, we 
present an example of two-dimensional site space and two types of its natural neighborhood systems. The 
three-dimensional analogue can be formulated accrodingly. 

Example 2.3. Let S be the space of pixels on an image of size M x N, that is to say, S — ■ 
1 < i < M, 1 < j < N}, where M, N e N. Recall the square distance p defined on M 2 : p{P,Q) = 
max(|x2 — xi\, j 2/2 — yi\) where P(xi, yi), Q(x2, 1/2) € M 2 . Fix I £ N, i is the radius of neighborhood system 
de on S defined as follows: for each (i, j) E S, 

d t ((i,j)) = {(«,«) £S:0< p((i,j),(u,v)) < £}. 

Therefore, (S,di) is a connected space. There are MN pixels in 5, and we enumerate these pixels in 
the following sequence: si = (1,1), s 2 = (2, l),...,s M = (A/, 1), s M +i = (l,2),s M+2 = (2, 2), s 2M = 
(M,2),...,s (J v-i)m+i = (l)-W)) s (N-i)M+2 = (2,N),...,s NM = {M,N), i.e., we list the pixels on the first 
column of the image in increasing order, then the second column until the last column. The (i,j) th pixel 
on the image is the ((j — 1)M + i) th element in the sequence, i.e., S(j_ 1 ) M+i = (i, j). For each (j, j) 6 5, 
let 

A \i,i) ~ d e((hj))f]{ s i, s 2,-;S(j-i) M +i-i} = de(s {j _ 1)M+l )f}{s 1 ,s 2 , ■-, s (j _i )M +i-i} ( 2 - u ) 

be the points in S within the square distance £ from that are either to the left or directly above of 
(i, j). The restriction to {s±, s 2 , S(j_i)M+i-i} ensures that only the previously simulated pixels that are 
to the left or directly above of (i, j) are used to generate the next pixel (i, j). One can verify that 

A\; Lj) = {(u,v) : i-(i-l)A£ <u< (i + l)A M, j — (J — 1) At <v< j-l}{J{(u, j) : i-{i-l)Al <u<i}. 

(2.12) 

It can also be shown that di((i,j)) C\{s±, s 2 , ■S(j_i)j\/ +J _i} contains only one connected component, i.e., 

Example 12.31 shows that the abstract connected site space (S, d) can have natural instances from real 
applications. The sequence of pixels enumerated above satisfies that Sk S <9({si, Sk-i}) for 2 < k < MN, 
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meanwhile {^( ij) }( 4 j)eS\{(i,i)} satisfy that A e (ij) = di((i,j))f]{sx,s 2 ,..;S ( j_ 1 ) M+i _ 1 }. Therefore, by 
RemarkESlA^ = ft((i,i)),V(i,i)G5. 

Kouritzin et al jlOj defined the same site space S as that in Example 12. 3[ but replaced the square 
distance with Euclidean distance and £ e R + is not necessarily an integer. When £ > the base 
set Ajj ^ can still be defined by (|2 . 1 1 1) albeit (|2 . 12[) docs not hold. Given £ > y/2, we still have A{f°? = 
d e ((i,j)), V(i,j) £ S. Nevertheless, when 1 < £ < y/2, d e ((i,j))f]{sx , S2, S(j-i)M+i-i} is not necessarily 
connected, i.e., Ai.* cannot be defined by (|2.11[) . 

Example 2.4. Let S = {i : 1 < i < 8} and s, = i for f < i < 8. The neighborhood of each consist of 4 
nodes which are connected to Sj by a dashed edge as illustrated by the following graph. 

<r T , 

3) r(r) 



4 Irvj^-'i 6 

5 



It can be verified that the neighborhood d(sj) of each Sj as a subset of S is connected. For example, 
d{s\) = 9(1) = {2, 3, 7, 8} is connected. But note that d(sr) f]{si, 1 < i < 6} = {1, 5, 6} is not connected. 
So we cannot define base set A S7 — {1, 5, 6}. 

Remark 2.9. Example 1 2 . 41 shows that the condition that the neighborhood of each site is connected cannot 
imply d(si)f~\{sj, 1 < j < i — 1} is connected for all 2 < i < N. In fact, for abstract space (S,d), it is 
difficult to find simple condition to guarantee that A Si = d(si) C\{sj, 1 < j < i — 1}, V2 < i < N. But 
fortunate enough, Example 12 . 31 illustrated that for many real random field applications, there exist natural 
neighborhood systems which have good properties and can satisfy our purpose very well. 

3. Novel Algorithm for Simulating Random Fields 

Let (S, d) be a connected space of sites, with valid setup {(sj, A Si )}fL 1 . For 1 < i < N, di 6 N is the 
cardinality of X Si , we denote X Si = {xl., ...,Xgf}. Based on Proposition 12. 1[ we have a novel algorithm 
for simulating X with state space X, given marginal probabilities {^si(x Si ) ■ x Si € 'K Si }fL 1 and given 
covariances of nearby sites {f3 Si ,ti ■ ti £ A Si }, for 2 < i < N. 

Do for i = 1,...,N: 
(1) Base on Definition 12.61 compute 

i-l 

R(X Si -i = x ai _ 1 ,...,X 31 =x Sl ) = ~[[u(X Sk =x Sk \X Sk _ 1 =x Sk _ 1 ,...,X 31 =x Sl ) 



fe=X 
i-l 



k=l 

for all chosen combinations of x si , x Si l . Here we choose x Sl , x Si-1 as follows: for each 
1 < k < i — 1, if Sfe G A Si , we use the simulated x Sk ; otherwise, we enumerate x Sk € X Sfc . 

(2) Take marginal to get H(Xa,. = xa b . )'■ 

u(Xa h = xa s .) = n^^.j =x s< _ 1 ,...,X Sl =x si ). 

*fc^A 3 . ,l<fc<i-l 

(3) Based on H(Xa s = Xa s . ), compute W{X Si = x{.\ Xa s . = xa s . ) for 1 < j < di, using 
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(4) Generate a [0, l]-uniform random variable U. For the given U, there exists unique 1 < j < di such 
that 

2-1 2 

^n(A s , =xi\x Asi =x Asi ) <u <^n(x Si =x u Si \x Asi =x A j. 

u—l u—1 

Then set X Si = x J s .. For notational convenience, we supress superscript j and use x Si to indicate 
the simulated value x 3 s . of X Si . 

Remark 3.1. There exists an alternative, more efficient way of computing H(X Aa = x As .): View A H as 
S, and let {(tj, B tj )}™Li be a valid setup for A Si , where rii is the number of sites in A Si . Now, compute 
H(X As . = x Ag .) as follows: 

n(x Asi = x Asi ) = u(x t] = Xtj \x Bt . = XBt] ). 

2=1 

In Example 12.21 we illustrated how to compute Ii(X A& — x A5 ) by treating A5 as 5. 

4. Summary of Applications of Novel Algorithm 

In this section, we summarize our prior applications of simulation algorithm of Section |3] to image 
generation, data authentication and target recognition. 

4.1. Application to Image Generation. Generating KNW-CAPTCHAs is an application of our simu- 
lation algorithm to image analysis. CAPTCHA is the acronym for "Completely Automated Public Turing 
test to tell Computers and Hummans Apart" (see [2]) and is widely used to prevent online resources intended 
for humans from abuse by automated agents. CAPTCHAs often appear to be images of characters or digits 
designed easy to read by humans and difficult to crack by computer programs. Kouritzin, Newton and Wu 
[TO] proposed a novel method for generating a type of CAPTCHAs named "KNW-CAPTCHAs" through 
random field simulation stated in Section [3] with common pixel state space X = {1, —1} = {black, white}. 
Roughly speaking, KNW-CAPTCHAs are generated by specifying proper pixel marginal probabilities and 
pixel-pixel covariances of alphabets and embedding these quantities into KNW conditional probabilities 
- formula (|2.10l) . One predominance of this method is that these random field CAPTCHAs can be sim- 
ulated in real time, yet another is significant resistance of these CAPTCHAs to attack. More details on 
this application can be found in [10] . With the algorithm in Section [3] and the alternative algorithm for 
generating Markov random field at the end of Section [5l we are also able to generate many variants of 
KNW-CAPTCHAs. 

4.2. Data Authentication Application. Data authentication is classifying data as true or fabricated 
and has been used in fraud detection and verification of data samples including financial data [7] . Detecting 
fake coin flip sequence was an application of our simulation algorithm to data authentication, see Kouritzin 
et al |8] where site space S = {1,2,..., N} consists of time units and coin state space is X = {1,-1} = 
{head, tail}. In [8], a filtering method was applied to simplified fraud-detection problem, classifying coin 
flip sequences as either "faked" (i.e., generated by a person) or "real" (i.e., generated by perfect flipping 
of a true coin). A true coin flip sequence has the expected head counts approximating | of total flips. For 
this reason, a faker that deviates from expected behaviour in one time period has to be compensated by 
later deviating from the expected behaviour in an opposite way such that deviant behaviour averages out. 
Consequently, marginal probabilities and pair- wise covariances between each flip and the flips that preceded 
in time were used to describe faker's behaviours. It followed that simulating fake coin flip sequences in 
real time became a fundamental step of solving the simplified fraud-detection problem. The much simpler 
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but preceded version of the algorithm in Section [3] was developed to simulate the faked coin flip sequences 
which are just specific random fields on the set S with state pace X = {head, tail}. 



4.3. Target Recognition Application. Analogue of simulation algorithm in Section[3]was also applied 
to target recognition by Kouritzin, Luo, Newton and Wu [9]. Kouritzin et al considered an imaginary 
detection problem of hidden targets such as rocket launchers in random forest. The rocket lauchers sit still 
in forest, and a surveillance aircraft or unmanned vehicle equipped with an electro-optic camera is flying 
over the forest to capture the forest with hiden target rocket launchers. The camera cannot penetrate 
through the foliage, and the foliage blocks the latent ground objects from detection. However, the camera 
can observe partial images of the ground through the gaps among the leaves. The forest is random in 
the sense that the foliage coverage is encoded with a binary representation (i.e., foliage and no foliage), 
and is correlated in adjacent regions. Similarly, the ground is a mixture of the grass and soil, and its 
color is randomly either green or brown with some type of correlation structure. The rocket launchers are 
camouflaged with the colors of grass (green) and soil (brown) and have another type of correlation structure. 
The information from within the forest called the observations, can be easily obtained from stored historical 
overhead pictures and analyzed pixel by pixel for each small area. As the captcha generation, simulation 
algorithm similar to that in Section [3] was used to generate the image of random forest, ground and 
hidden targets. The weight functions used by the SERP (selectively resampling particle) filter [3] resort to 
Proposition 2.7 of [9], which is a specific vector version of Proposition 12. 1 1 



5. Algorithm Properties and Constraints 

An effective algorithm that can generate random fields is related to two properties: regularity and 
consistency. Regularity means that the right hand side of (|2.3p is within [0,1]. Consistency means the 
permutation and marginality properties. In this section, we explore the constraints that can guarantee 
these properties. The outline of this section is as follows. First, we give the necessary and sufficient 
conditions of the regularity type for singleton case of A Si in Proposition l5.ll Second, we consider conditions 
for permutation and marginality properties for A C S where sites in A are neighbors of each other. For 
random field Xa, there are no conditions needed for the marginality property, and there exists a necessary 
and sufficient condition presented in Proposition 15.21 and 15.31 for the permutation property. Third, for an 
arbitrary site space (S,d), the neighborhood system d is extended to d' such that each pair of sites are 
neighbors of each other under d', i.e., for each site s £ S, let d'(s) — S\{s}. Furthermore, the covariances 
between each site s £ S and sites outside of d(s) are assumed to be zero. Based on these assumptions, 
we propose an alternative algorithm that can generate a true Markov random field with the remarkable 
property of consistency. 

We first determine the conditions of regularity type, i.e., the constraints for {f3 Si ,ti '■ U £ A Si }fL 1 of 
Proposition 12.11 that cause it to produce a value in [0,1]. A theoretical constraint relating a particular 
/3 Si ,ti to the given marginal probabilities {ir Si (x Si ) : x Si £ X Si } and {^(a;^) : Xt { £ X ti } is given in the 
following proposition. 
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Proposition 5.1. If S = {sj^ and A s = {t} is a singleton, then a necessary and sufficient condition on 
/3 s ,t for (|2.3j) to be in [0,1] is that f3 s ,t is a valid covariance; which corresponds to the condition 

-olvlKs{xs)Kt{xt) a cr I, cr I t ( 1 - 7r s (x s ))7r t (x t ) 



max 



A- 



^a.i.^i, \ v 7r s ( a; s)'''«( a; t)( a:; s - fii s )(xt - fii t ) ' X 7r s (x s )7rt(cc t )(x ;j - fli s ){x t - /U#t) 
j) t ex t .ii/ji it 



i s ex„i^ftf t \7r s (x s )7rt(xt)(x s - fii a ){x t - fii t ) 7r s (a; s )7ft(a; t )(a; ;s - fii s ){xt - M*J 

(5.1) 

Proof. For x s € X s , xt € Xt , we have by (|2.3j) that 
U(X S =x s \X t =x t ) G [0,1] 

of of t 7r s( af *) 7r *( :c t) + 7r s (a; s )7r t (x t )(x s - /i#J/3 Sit (x t - /i# t ) 
■£> — 5 2 — 2 7 — \ 6 i°> !J 



^ < al s o-l t TT s {x s )n t {x t ) + Ti s {x s )n t {x t )(x s - /i# s )/3 Sjt (a; t - /x# t ) < (r| s cr| t 7r t (x t ) 

4^ -CT? s cr~ t 7r ;j (a; s )7rt(a;t) < 7r s (a; s )7r t (a;t)(a; s — fii s )L3 s ,t(xt — MsJ < ^I, *^ (1 - ^(x^))^ (x t ) 

(5.2) 

By (j5.2[) , there is no constraint on /3 Sjf when x s — fii s or xt = /Li# t . Therefore, /3 Si t being consistent with a 
proper right hand side of (|2.3|) is equivalent to, 

o g - g I s ^I t ^(^)7Tt(^) a ^l^lS 1 ~ ^s(x s ))TT t (x t ) 

M |_7r ;s (x 5 .)7r t (x t )(x s - fii„){x t - /z #t ) ' ^ 7r s (x s )7r t (x t )(x s - /iff J (if - fli t )' 

-^toft^O^Mxt) . , g| 3 o-| t (l-7r s (x 8 ))7r t (xt) 

7T s (x s )7r t (x t )(x s - fli s )(x t - flit) n s (x s )TT t (x t )(x s - fli s )(x t - fli t ) 

for all x s G X s , x s ^ fii s , and xt G Xt, xt ^ /i# t . From (|5.3[) . (|5.1j) follows. 
In the Appendix, we will prove that 

rf fv v \ / \ / \ , 7Ts(a; s )(a; s - fn s )cov(X s , X t )itt(xt)(xt - fH t ) .s 

II(A S = x s ,X t = x t ) = 7r s (x s )7r t (x t ) H 2 — 2 ( 5 - 4 ) 

a H e a nt 

By multiplying both sides of (|5.4p by - ^fr^r^ ; for any x s G X s , x t G Xt, we can verify that 

(x s - /Lt#Jccw(X s ,X t )(a; t - fii t ) = - ' ^ — r[n(X s = x s ,Xt = x t )n(X s ^ x s ,X t ^ x t ) 

7r s (x s )7r t (xt) 



(5.3) 



-n(X s = x a ,X t ? x t )U{X s ^x Sl X t = x t )}. (5.5) 



It follows that 



o-% a a% t -K s {x s )-K t {x t ) (Ji o- 



< - - ,'■ *) M X s = x S) X t ^x t )Il{X s ^x s ,X t = x t ) 

7r s (x s )7r t (xt) 7T s (x s )7rt(Xt) 

< (x s - fii s )cov(X s ,X t )(x t - fli t ) ( 5 - 6 ) 

2 2 

< - ?*\** f M X a = x s ,X t = x t )IL(X s ? x s ,X t + x t ) 



< 



7f s (x s )7Tt(xt) 

a| s (T| t (l-7r s (x s ))7rt(xt) 



TTs{x s )^t{x t ) 

The inequalities in (|5.6p are actually tight. If for x s G X s and x t G X t , {X s — x s } — {X t — x t }, then 
{X s ^ x s } = {X t ^ x t }. It follows that 

{X s = x s ,X t = x t } = {X t = x t }, {X s ^ x s ,X t ^ x t } = {X s ^ x s }, 
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and 

Then, by (|5.5[) . we have 
(x s - ft*,)cov(X s , X t )(x t -Mff*) 



{X s = x s ,X t ^ x t } 



2 2 

°*'°* < r n(X t = a; t )n(X s ^i s )-0x n(X s ^ = art)] 



7r s (a; s )7ri(a; t ) 
al s al t (l - TT s (x s ))TT t (x t ) 
ir s (x s )Tt t (xt) 



(x 3 - na s )cov{X s ,X t )(xt - /z# t ). 



Similarly, if {X t ^ x t } = {A" s = x s }, we have 

7f«(a; 8 )7f t (xt) 

Therefore, by (|5.2p and (|5.6p . (15 . 1|) is exactly the constraint required for /3 St t to be a valid covariance. □ 

Secondly, we consider the case when A s is not a singleton. For this case, we assume that cov(X s , X t ) = j3 s 
for every i G A s , i.e., the state X s on the site s has same covariance with each state in A s . A simple necessary 
condition on j3 s is given in the following corollary. 

Corollary 5.1. Assume that cov(X s , X t ) = j3 s for each t € A s . Then, a necessary condition on /3 S for 
([23]) to be in [0,1] is 



-<rl a <># 7r s (x s )7r t (x t ) 



x.ex"^,,, V,7T s (a; s )7f t (a; t )(ar s - ^ s )(x t - /z #t ) ' \ 7r s (a; iS )7r t (a; t )(a; s - H%,){x t - /« # J 

a; t EX t .ii/jii t , 
*GA S 



-vlvlK S {x s )lT t {xt) 



w ^I^I^ 1 ^ 7r «(^)) 7r t(a ; t) 
!c s ex7^r^»„\ li 7r s (a: 8 )7f t (a;t)(x s - Ms-JO^t ~ M* t ) 7r s (a; s )7r t (a; t )(a; s - ^ s )(x t - n% t ) 

iiEX t ,i t ^)ii t , 
*GA S 



(5.7) 



Proof. For each i € A s , we get a condition on /3 S by applying Proposition l5 . 1 1 to v4 st = {t}. (|5.T[) follows by 
combining all these conditions into one. This is also the reason that (|5.7p is only necessary, not sufficient 
for (J231) to be in [0,1]. □ 

Assume that A — {si : 1 < i < n} C S where sites in A are neighbors of each other under <9, we denote 
the joint probability mass function of X Sl , ...,X Sn by II Sl Sre (a; Sl , ...,x s „) which by formula (]2.3|) satisfies: 



n~ 81 (^si j •*•) ^Sj) 



i-l i-1 



7r s . (x s .)(a; s . — /Ajf ) v-~* TT 
= n su ... iSz _ 1 (x Sl ,...,x St _ 1 )n Si {x Si ) + 2 _[_[ 7r Sfc (a; s J 

*»i i=i k=i,k?j 



(5.8) 



for i = n, n — 1, 3 and 



,s 2 (^Sl ) -^S2 ) — ^"S2 (*^S2 ) 7I 'si (*^Si ) "i" 



(5.9) 



We have the following lemma giving the closed form of U Slt ... )Sn (x sl , ...,x Sn ): 

Lemma 5.1. Based on the assumptions on A, the joint probability mass function of X Sl , ...,X Sn is 

Ilsi , . . . ,s n (,X Sl , • • • , X Sn ) 

'Tt Si (x Si )(x Si ~ /l# s .) ^ 



E 



e n 'Wkj 
j=i fc=i,fc/j 



11 7r Sfe (a; s J + J_J_7r Si (x Si ) 



of 



fc=i+l 



(5.10) 
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for each x Si € X Si (1 < i < n). 

Proof. Repeated use of (|5.8p followed by (|5.9|) yields 

f^-Si , . . . ,S n (*^Sl 3 •**) ) 

tt , \ / , , K s „{x Sn )(x Sn - /if ) ^ Pan^iKsjiXs^iXsj ~ far,.) 
= U su ..., Sn _ 1 {x Sl ,...,X Sn _ 1 )'K Sn {x s J + -5 2^ 11 ^(^J ^2 

"^-Sl ,...,S n _ 2 (*^Sl ) %S n — 2 )^S n _l (^5 n _l )^S n (^S n ) 

7r s „_i(a: s „_i)(a; a „-i -M*a„_ 1 ) ^ tt . , \ P* n -i,»jK»j( x *iK x »i ~ l**..,) , \ 

+ ^2 11 ^ ^ ) ^2 (^Sn ) 

J=l k=l,kj&j 

^s n {Xs n ){x Sn - /i* s )^ "inr 1 A , , ,Sj ^Sj ( x Sj ) (^Sj ~ Mtt s , ) 
+ -2 11 -2 

= 2^ I —2 2^ 11 ^(^J ~2 j - 11 ^{XsJ + l[ir Si {x Si ). 

(5.11) 

□ 

Remark 5.1. Our KNW quick simulation random field Xa on A satisfies the marginality condition 
defined in Definition 12.71 This can be proved by formula (|5.10p and the facts that ff Sj (x Si )(x Si — 

A** 3 .) = 0, Tr Si (x Si ) — 1, ir Si (x Si ) = 1 for 1 < i < n. Therefore, there are no derived 

conditions on /3 SijSj (1 < i =/= j < n) and 7r Si (1 < i < n) for Xa to satisfy the marginality condition. 

Next, we determine conditions of permutation type. Assume that s,t G S are neighbors of each other. 
We have by (|2.3j) and the multiplication rule that 

U(X S = x s \X t = x t )U(X t = xt) = U{X t = x t \X s = x s )U(X s = x s ), (5.12) 

for x s € X s , xt € X t . Hence, n tjS (a: t , x s ) = H St t{x s ,Xt), i.e., when we compute the joint probability of two 
simulated random variables X s , X t , the order of simulating X s and X t does not matter. It means that 
the specific form of (|2.3|) guarantees the permutation property of any two random variables with their sites 
being neighbors. This is a very desirable property of formula (|2.3|) when the site space 5* is one-dimensional, 
e.g., ScN. 

When there are n > 3 sites which are neighbors of each other, we have a necessary and sufficient 
condition for permutation property given by Propposition [5~2l below. First, we introduce some notations 
and preliminary results. For each 1 < i < n and 1 < m < di, let 

where a;"* € X Si = {x\. 1 ...,2;*}. In the proof of Propposition 15 . 2\ if we use x Si , omitting m for a;"?, then 
we do the same thing for i.e., we use z Si . For any duple 1 < i ^ j ' < n, let 



• (5-14) 



Note that z". ; is the weighted standardization of the element x™' by the weight 7r Si (cc"*) and p Si ,s 3 is 
an adjusted correlation coefficient of X Si and X s . . Without loss of generality, we assume that p Sl .s 2 



16 



BIAO WU, MICHAEL A. KOURITZIN*, AND F BASER NEWTON 



and p si ,s n 7^ 0, i- e -j Psi,s 2 7^ and p 31 , a „ 7^ 0. We introduce d (= 5Z™ =1 di) variables y^ and denote d 
components 



(5.15) 



for 1 < ttj < c?i, 1 < 2 < n. 

For each triple 1 < k < n (i,j and k are distinct integers) and 1 < M,- < di, 1 < ttj < dj and 
1 < Wfc < dfc, we formulate equations in y as follows: 



;u« ZUj ~ U k _ ~Ui ~u k ~ u 3 _ ~u 3 ~u k ~ Ui 

^sj Hsi-sj y Sk — * Si 4 Sk i-'si,s k y Bj — *- Sj ^ Sfc l J sj,s k y Si 
(|5.16p is equivalent to the linear equations of Blockwise Canonical form: 



(5.16) 



z Sj Psj,s k y Si 

ZU k ~ Ui 

s k l J sj,s k y Si 



z si Psi,s k y Sj 

zu, ~ u k 

z si l J si,s j y Sk ■ 



(5.17) 



The coefficient matrix of the equations (|5 . 1 7[) is 



Uj = 1 



u j = d j 



u k = 1 



Uk = d k 



Psj,s k z\ 



Psj,S k Z Sj 



P S 3' S k Z l k 



Psj,s k z s k 



Psj,s k zl. 



P Sj ,s k Z S j 



Psj,S k 



si 



P s j> s k z s k 



Psj,s k z l 3 



Ps j ,s k z s j 



Psj-s k z l k 



Ps i ,s k z \ i 
~2 

~Psi,s k z Si 



-ps„s k Zs' 



Ps 



Psi,s k z Si 

~ ~2 
~Psi,s k Z s 



Psi,s k z sl 



-Ps^s 3 Z 2 Sz 



■si -Ps, 

(5.18) 

for each given triple i,j and k. 

In Section [2 M n is introduced as a set containing n arbitrary elements. Hereafter, we let M n = 
{1, 2, • • • , n}. Denote the group operation of Gm„ composition by o, and denote the identity permutation 
of M n by e, e.g. e(i) = i, Vi € M n . The generators of Gm„ written in cyclic form are (i i+l), Vi € M n \{n}. 



Proposition 5.2. Assume n > 3 and A = {s; : 1 < i < n} C 5 with all sites in A being neighbors 
of each other under d. Then, a necessary and sufficient condition for the permutation property of the 
joint distribution of X Sl , ...,X Sn , i.e. II sl) ... )Sn (x Sl , ...,x Sn ) is that (|5. 15[) is a solution of linear equations of 
blockwise Canonical form (|5.17l) . 
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Proof. For a given permutation g G Gm„ , similar to (|5.10l) . the joint pmf of X s ,^ , X s ,< is 



i-l i-1 



i=2 L ^ j=l k=l,k^j ' fc=t+l 

to 

+n^(.)K w ) ( 5 - 19 ) 

for each a; Si € X Si (1 < i < n). A necessary and sufficient condition for the permutation property of the 
joint distribution of X Sl , X Sn is that for any pair of permutations g,h G Gm„ , 

ns g (i),...,s g( „) ( x s g(1) > •••) x s s („) ) — n Sfl(1) , ...,* h ( n ) ( a; s fe(1) , a^s h( „) )■ (5.20) 

First, we prove necessity: For any g € Gm„, there exists ft G Gu n satisfying ft(2) = g(3), ft(3) = g(2) and 
ft(i) = g{i), V« G M n \ {2, 3} (We can also choose ft G G A /„ such that ft(l) = g(3), ft(2) = g(l), ft(3) = g{2) 
and h(i) = g(i), Vi G M n \ {1, 2, 3}). Then it follows by (jS~ll)|) and (j5~2Hj) that 

,Sg(2) ^Sg(l) ^ S S(2) (^sO) ( Xs g(3) ) _ ^ s s(3) ( Xs g(3) )) = ^ s g(l) > s g(3) ^ s g(l) ^ s g(3) ( 7Fs g(2) ( 2 ' s g(2) ) _ ^ s g(2) (' Es g(2) ))' 

(5.21) 

Similarly, for the same g, there exists ft G Gm„ satisfying ft(l) = g(3), ft(3) = g(l) and h(i) — g(i), 
Vi G M n \ {1, 3} (Another choice of ft G G Mn is that ft(l) = g(2), ft(2) = ,g(3), ft(3) = g(l) and ft(i) = g(i), 
Vi G M„ \ {1, 2, 3}). Then we have that 

Ps S (l),Sg(2)Zs s(1) Zs g(2) ( 7r S 9(3) (^S a (3) ) _ ^Sg(3) ( X S g (3) )) = P«g(2) ,« fl (3) ^g(2) ^S g(3) ( 7r S g(1) (^(1) ) — ^S g(1) (^(y ))• 

(5.22) 

Since g is arbitrary, it follows by equalities (I5.2ip and (|5 -22[) that 

Ps t , Sj 5 Sz Z S] (ir Sk (x Sk ) - 7T Sk (x Sk )) = p s ^ Sk S Sz S Sk (TT S] (x S] ) - n S] (x S] )) = p s ^ Sk Z S] z Sk (-n Sx (x Si ) - 7T S( (x Si )) 

(5.23) 

for all x Si G X Si , x Sj G X s ,, G X Sfc and distinct triple 1 < k < n. (I5.23P is equivalent to (|5.15p 
being a solution of linear equations of blockwise Canonical form (I5.17[) . 

Now we prove sufficiency: If (|5.15[) is a solution of linear equations of blockwise Canonical form (|5.17l) . 
then (|5.23l) follows. Next, we prove that for each g G Gm„, Vi G M n \ {n}, we have 

n Sg(1)i ...,s s(n) ( x s g(1) , ■■; x s g(n) ) — II Sgo(i i+1)(1) ,...,s go(i i+ i)(„) ( x s go(i i+ i) ( i) ! •••) x s go(i «+!)(„))• (5-24) 

By using (|5.19l) . we can verify (|5.24[) holds for i = 1 without any other condition. For fixed 2 < i < n — 1, 
(I5.24p is equivalent to 

^»sW »j(i-l)i's(«)i»s«+l)i-i»iW ( Xs g(i) ' ^Sg^-l) ' Xs g(*> ' X * 9 (i+l) ' — ' X *sC») ) 

= n Sg(1)i ... iSg( ._ 1)iSg( . + 1) ,S g(i) ,...,S g( „ ) (^S a (l) ) •••) ^(i-l) ) X Sg( I + l) > X »g{i) ' X * S (n))' (5.25) 

which though (|5.19l) is guaranteed by 

Ps g u),s g{i) z Sg{]) z Sg(t) (ir Sg( . +1) (x Sg(i+1) )— vr Sg(i+1) (a; Sg(4+1) )) = p Sg(3) , Sg(i+1) z Sg0) ^s g(l+1) ( 7r s g(i ) \ x s gW )—^s g{i) ( x s g(i) )), 

(5.26) 

VI < j < i - 1. For each 1 < J < i - 1, 2<i<n-landg€ G A/ „, (|5"^d| follows by (1031) . Therefore, 
(|5.24p is proved. Let g = e, then (I5.24p implies that for each generator (i i + l)(l<i<n — 1), 

n si ,..., s „ (x si , x Sn ) — n S( . ,...,s (i i+ i)(„) ( a; s( i i+i)<i) i •••! i+ i)(„) )• (5-27) 

(|5.24p and (|5.2T[) together lead to the permutation property of the joint distribution of X Sl , ...,X Sn . □ 
Before we discuss the solution space of the linear equations of blockwise Canonical form (|5.17l) , we point 
out its trivial solution where is the d-mensional zero vector. This implies a simple albeit special 
condition on the permutation property of X S1 , X Sn . 
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Corollary 5.2. Assume n > 3 and A = {si : 1 < i < n} C S with all sites in A being neighbors of 
each other under d. Then, a sufficient condition for the permutation property of the joint distribution of 
X Sl , X Sn is that TT Si (-) = 7r Si (') for 1 < i < n. 

Proof. For each 1 < i < n and 1 < M, < di, 7r S( (") = i"si(") implies that y" 4 = 7t Si (%s*) ~ 7T s i (x" i ) = 0, 
i.e. (|5.15|) gives the solution of (|5.17j) . The conclusion of this corollary follows by the sufficiency of 
Proposition E2I □ 

We make a very loose assumption that no distribution in {7r s } s6 5, {tt s } s ^s and {n s }s<ES is degenerate, 
i.e., being the distribution of a constant random variable. Now we discuss the structure of the solution 
space of the linear equations of (|5.17|) . The solution space of (|5.17|) is the null space of the canonical form 
matrix in ([5TT8]) . This space depends on the value of n and the correlation structure among X S1 , ...,X Sn , 
i.e. p Si , Sj (1 < i < j < n). We deal with the case n = 3 and the case n > 4 separately. 

When n — 3, i,j and k essentially have only one combination, e.g., i = l,j = 2 and k = 3. Based on 
the assumption above, there exists at least one 1 < Ui < d% such that z^ 1 ^ 0. It follows that (|5.17j) has 
solutions: 



y u sl 



~Ml,,Ml 



KS1,S2-'S1 



VI < Ui < di 

VI < 1*2 < C?2 

VI < M3 < e?3 



(5.28) 



where y* 1 is a free variable. It means that the solution space of (|5.1T[) is one-dimensional. 

When n > 4, it becomes more complicated. First, we define the correlation multiplication equalities. 

Definition 5.1. Correlation multiplication equalities are 

Psi,SjPs k .Si — Psi,SiPsj,S k = Psi,S k Psj ,S( (5.29) 

for each quadruple 1 < i, j, fc, 2 < n where i, j, fe and 2 are distinct integers. 

In Example l5.11 we will construct a random field whose nearby correlations indeed satisfy the correlation 
multiplication equalities f|5 -29[) . 

For each quadruple 1 < i,j,k,l < n with i,j,k,l being distinct, we may consider the triples (i,j,k), 
(«,j, /), (i,k,l) and (j,k,l) and apply the n = 3 case to each of them. Each of the four groups of equa- 
tions of blockwise canonical form has solutions formally similar to (|5.28j> . that is to say, we can solve 

/..Ui u j u k \{dz,dj,d k ) r u . Uj u , (di,dj,di) r u< Uk Ul \(di4k4l) j 

\y Si ' ys 3 j </ Sfc j( Tt4 ,u i , Uj fc)=(i,i,i)' ti/ Sl j i/sj > s/ Si j(« < ,« 3 -,u I )=(i,i,i)' ti/ s , > i/ Sfc > i/ Si j(« < ,« fe ,« l )=(i,i,i) dIlu 

{y*/>y**>y*/}(u 3 -'«k'«V)=(ii l) separately. Then same variables, e.g. {y^*}* = i have solutions to three 
group of equations with unknown variables y u s ] y^ }f.;J' i t*)= ( i,i,i). {^Sy"/^C}£'i',S=(i,i,i)' 
and {ys*,Vs£ )y«/}fo.'„*'„,wi 1 1) respectively. It follows that (|5.29l) is a sufficient and necessary condition 

f° r {2/"/}*=!' {Vsj }u=v {ysfc}«fc=i an d {2/s ( ! }u(=i have coincident solutions to equations of blockwise 
canonical form involved. Again, we assume that z™ 1 7^ for some 1 < u\ < d\. Then we can verify that 
the correlation multiplication equalities are equivalent to that (j5.I7[) has solutions: 



VI < u\ < d\ 



P~s 2 ,s n z s ™y s * 



VI < Ui < d i: 2 < i < n- 1 



VI < u n < d n 



(5.30) 
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where y^ 1 is a free variable. It means that for the case n > 4, when the conditions (|5.29p are satisfied, the 
solution space of (|5.17j) is also one-dimensional. 

To summarize, when n — 3, (|5.17|) has nonzero solutions expressed by f|5 . 28|) . When n > 4, (|5.17[) also 
has nonzero solutions formulated in (|5.30p . given the correlation multiplication equalities conditions 
(|5.29p are satisfied. The conditions (|5.29j) themselves are not simple though. We need to further our 
investigation on the conditions that can guarantee the existence of (|5.29|) . 

Lemma 5.2. Assume n > 4, p Sl . S2 7^ and p Sl ,s n 7^ 0. The correlation multiplication equalities 
conditions (|5.29[) are satisfied if and only if 

p = P'^Psus* V3<fc<n-1 



Psi, 

Psi,SiPs 2 ,S rl Psi,Sj 

Psi,s 2 Psi ,s n 



V3<i^j<n-l (5.31) 



Psi,S2 

where p Sl . Sk (2 < k < n) and p S2 ,a n are n free variables. 

Proof. In (|5.29p . there are (™) many ps and 2(") equations. One can easily verify that (|5.31[) solves (|5.29l) . 
On the contrary, for given n free variables p Sl ,s k (2 < k < n) and p S2 ,s n , we solve (|5.29j) and obtain its only 
solution (|5.31[) as follows. For 3 < k < n — 1, apply ()5.29j) to 1,2, A;, n, then p Sl ,s 2 Ps k ,s n — Psi,s n Ps 2 ,a k = 
Ps 1 .s k Ps 2 .s„ follows, which results in the first and third equalities of (15 .3 1[) . For 3 < i j < n— 1, we apply 
(|5.29|) to l,2,i,j. As a result, p Sl ,s 2 Ps i ,s j = Ps 1 ,s i Ps 2 ,s j holds. Then we plug in the first equality of (|5.31j) 
and the second equality of (|5.31|) follows. □ 
Proposition 15.31 follows by Proposition [5T2] (|5.28[) and (|5.30l) . and Lemma [S~2l 



Proposition 5.3. Assume that {si, s„} are n > 3 sites of (S, d) which are neighbors of each other. 
Assume for n > 3, p Sl .s 2 0, p sllS „ 7^ 0, 5"* ^ for some 1 < ui < d\ and for n > 4, (|5.31[) holds. Then, 
a necessary and sufficient condition for the permutation property of the joint distribution of X Sl , ...,X Sn 
is the solutions of (|5.17j) . i.e. (1 < Ui < rfj, 1 < i < n) given by (|5.15|) . have closed form as follows: 



Z S1 US! 



y u sl = ^ 1 « 1 )-^ 1 « 1 ) = ^ 1 - vi< Ul <d! 

Vll = fr.,(aff ) - K Si (aff ) = Hs :' Sn s i™ x \H<Ui<d u 2<i<n-l (5.32) 

VZ = *. n (s%) - n Sn (xZ) = ^f^:!} 1 Vl<u n <d n . 

Psi,s 2 Zsi 

We give an example which illustrates the application of Proposition 15. 3[ Corollary 15.21 Lemma I5TT1 and 
Corollary 15. II as follows. 

Example 5.1. Let S = {1,2,3,4}, Sj = i for 1 < i < 4 and d contain the following subsets of S: 
8(1) = {2, 3, 4}, d(2) = {1,3, 4}, 9(3) = {1,2, 4}, 9(4) = {1, 2, 3}. Under this d, all sites in S are neighbors 
of each other. 

Let the common space of states for each site 1 < i < 4 be X = {1, —1}; ^i(xi) = \ Vx^ € X; 7^(1) = p 
and 7Ti(— 1) = 1 — p; 7Tj(l) = p and 7Tj(— 1) = l—p where < p,p < 1. It follows that p,^ i = and erf. = 1. 
For convenience, let x\ = 1 and x\ = -1 (1 < i < 4). Then by (|5.13j) , z\ = \x\ for 1 < % < 4, k = 1,2. 
Let P12 — /3%3 = /?23 = Pu = /?24 = /?34 = /3, i.e., all covariances between nearby sites are the same, then 
P12 = P13 = P23 — Pxa = P24 = p34 = P, and the Correlation multiplication equalities (|5.29[) hold. 
By (|5.14p . pi2 = pi3 = P23 = Pii = P2i = Pn = P = P for an y duple 1 < i ^ j < 4. Consequently, all 
coefficients of linear equations (|5.17p arc determined. 
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If p = p, the sufficient condition Tti(-) = 7r,(-) (1 < i < 4) in Corollary 15.21 is satisfied, then the 
permutation property of IIi^, 3,4(^1, x 2 , x 3 , x 4 ) follows; otherwise, p 7^ p and it follows from Proposition 
15.31 that ni j2 , 3,4(^1, x 2 , x 3 , x 4 ) is still permutable because the solution of (|5.17[) given by (|5.15[) is 

y\ = p — p VI < i < 4 

Vi=p — p VI < « < 4, 

which exactly takes the closed form given by (|5.33[) . The permutable joint pmf of X\, X 4 follows directly 
from ()5.10p as 

111,2,3,4(^1, x 2l x 3 , x 4 ) (5.33) 
4 ! 

= J|7ri(xi) + -^[iri^^^^^) + x 1 x 3 tt 2 (x2)^4:(x4) + x 2 x 3 TT 1 (xi)ir 4 (x 4: ) 
1=1 

+x 1 x 4 n 2 (x 2 )n 3 (x 3 ) + x^^^x^^x^ + l3lA(ii)f 2 (i2)] 

for each 6 X (1 < j < 4). 

The condition on j3 such that the right hand side of (|5 . 33[) is within [0,1] can be given in explicit 
inequalities of (3. For illustration purpose, we only consider the special case of p = p = 5. For this case, the 
necessary condition obtained from Corollary lem:jointpmf is /3 G [—1,1]. One can verify that the necessary 
and sufficient condition is actually (3 G [— g, |]- 

By formula (|5.10p . for every pair 1 < i ^ j • < n, if p Si , Sj — 0, X Si and X Sj are independent. More 
generally, we have the following corollary about the independence of state on a site with uncorrelated 
states on its neighbors: 

Corollary 5.3. Suppose 7r Si (-) = 7r Si (-) (1 < i < n). If for some 1 < i < n, X Si is uncorrelated with rest 
X Sj (1 < i 7^ i < n), then X Si is independent of X Sj (1 < j ^ i < n). 

Proof. By Corollary 15.21 we only need to prove the statement if X Sn is uncorrelated with each X Sj (l < 
j < n — 1), then X Sn is independent of {X Sj }™~^. This statement follows immediately by using formula 
(|5.10p for the cases n and n — 1 respectively. □ 

Remark 5.2. Assume n Si (-) = Tr Si (-) (1 < i < n). Even if (X Sl , X S2 ) T and (X S3 ,X S4 ) T are uncorrelated, 
(X Sl , X S2 ) T and (X S3 ,X S4 ) T are not necessarily independent. But Corollary 15.31 does provide us a way to 
producing a true Markov random field with the remarkable properties of marginality and permutation. 
Recall the concept of connected space (5,9) in Section [5] We extend the neighborhood system d to d' as 
follows: for each site s e S, let d'(s) = S \ {s} and p„ tt = 0, Vi E d'(s) \ d(s) = S\ (d(s) \J{s}). What's 
more, for each site s € S, because X s is uncorrelated with {X t :t G S\ {d(s) 1J{ S })}: Xa iS independent of 
{X t : t G S \ (d(s) {J{s})} by Corollary 15.31 Now, let's return to the original neighborhood system d. The 
previous conclusion infers that the state of each s G S depends only on states of its neighbors t G <9(s), 
regardless of rest of the states. This exactly matches the definition of Markov random field in Definition 

m 

Before we state an alternative algorithm to generate our KNW Markov random field with those good 
properties of marginality and permutation, we update formula (|2.3j) with the sufficient condition for per- 
mutation, i.e., 7T S (') = 7Ts(-) (s G S). For any ordered sequence {si}fL 1 of sites in S, it follows by formula 
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and Corollary 15.31 that 

U(X Si =x ai \X Si _ 1 =x ai _ 1 ,...,X ai =x Sl ) 
= Il(X Si = x Si \X As . = x Asi ) (5.34) 
, > n Si (x Si ){x Si - fa ) / \ f3 SiM Tr u (x ti )(x ti ~ ^ t .) 

= + ^ L { II ■ a2 ( = } , 

where A Si = <3(sj) f|{ s i: — i Si-i}- 

Now we propose the alternative algorithm to generate the KNW Markov random field with the desired 
marginality and permutation properties on S: 

Do for i = l,...,N: 

(1) Choose arbitrary site s £ S\{s±, s^-i} and denote it by S{. Then, let A Si = d(si) f]{s%, Sj-i}, 
and compute H(Xa 3 = xa 3 .) by treating A Si as S and recursively repeating all the steps listed 
here. 

(2) Based on H(Xa„. = xa s . ), we compute II(X Sj = | Xa s . — x As . ) for 1 < < <i, using (|5 .34[) . 

(3) Generate a [0, l]-uniform random variable U. If 

^ n(x Sj = x^i | = .t Asi )<u< Y,n(x Si =x%\ x As% = x Aa . ) 

Ui — l Ui — 1 

for some 1 < j < di, then set X Si — x° s . . We use x Si to indicate the simulated value x J s . from X Si . 

Remark 5.3. Note that in the generation of random field X$ on (S,d), we made the extra assumption 
that for each site s £ S, /3 s , t = 0, Vt £ S \ (d(s) {J{s}). The gain is the simplicity of the new algorithm 
compared to the algorithm in Section [3] and the desirable properties of marginality and permutation. This 
extra condition is actually necessary for generating a Markov random field, since state of each site on a 
Markov random field has to be independent of states outside of its neighborhood so as to be uncorrclatcd 
with them. Note also that S = {si}fL 1 is not prescribed, but dynamically chosen. Here we took the great 
advantage of permutation which guarantees that our simulated Markov random field does not depend 
on how we order sites in S. This alternative algorithm behaves in Gibbs sampler manner because every 
time it simulates state on a new site, it uses as much information as possible from simulated states in 
its neighborhood. But it is not Gibbs sampler algorithm, because this algorithm also simulates a Markov 
random field within one-pass. 

Note also that in the above algorithm, H(Xa b . — %A„.) is computed through treating A Si as S and 
recursively repeating all the steps in the algorithm, not through the multiplication rule employed in the 
algorithm of Section [3l The reason is that the sites in A Si may not be neighbors of each other, if we 
use the multiplication rule, the Markov property to be established will be broken. Since there is no such 
concern as the Makrov property for the algorithm in Section |3l multiplication rule can be applied there. 
Gcnearally speaking, S is big and neighborhoods are small, the method of computing TI(Xa s . = xa s .) in 
this alternative algorithm is much more efficient than that using multiplication rule. 

6. The Appendix 

Proof of Lemma \2.1\ First, we prove that we can enumerate the sites within H in a sequence {si}™ =1 such 
that Si £ d(H (J{si, Sj_i}) for 1 < i < n. Because H is nonempty, H c is a proper subset of S. If 
H = S, d(H c ) — 9(0) = S, then we pick up arbitrary si £ d{H c ) — S. Otherwise, H is a proper subset 
of S, so H 7^ 0. Because S is connected, d(H c ) ^ 0, we choose arbitrary s\ £ d(H c ). Assume that 
for some 1 < j < n we have {si}j =1 such that Si £ d(H c {J{s±, ...,s^_i}) for i = 1, Then, we have 
Si ^ H |J{si, Sj_i} for i — which implies that Sj (i = are j different sites within H. 
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Because 1 < j < n and H consists of n sites, H c (J{si, Sj} is a nonempty proper subset of S. Since S is 
connected, d(H c U{si, Sj}) ^ 0, we can find some arbitrary Sj + i € d(H c lj{si, sj}). By induction, 
the statement is proved. 

By the above construction, for each 1 < i < n, Si G d(H c lj{si, Si~i}), it follows that 
d(s i )f][H c [j{si,...,s i - 1 }} ^ except when i = 1 and H = S. If H = S, then d(s t ) f][H c \J{ Sl , s;_i}] = 

0, we let mi = 1 and B s \ = such that d(si) f][H c |J{si, s 4 -i}] = |J B{., We also let A Sl = B^. To 

3=1 

prove the general case, it is sufficient for our purpose to prove for any nonempty A a S, there exist m > 1 

m 

and exclusive subsets {B : '}JL 1 such that A = B^ and each B^ (1 < j < m) is connected. 

3 = 1 

For a given nonempty A C S*, we now present one way of choosing a nonempty connected component 
£? C A. We start £? with the empty set (B = 0). We select an arbitrary site t\ € A, and append ii 
to B (B = {ti}). If <9(i?) P| A = 0, that is to say, B has no neighbors within A, then we stop and B is 
finalized (B = {ii}). If d{B) f| A ^ 0, then we pick up a site £ 2 € d(B) f| A, a neighbor of B within A, 
and append ti to B (B = {ii^})- If <9(£?) p|^4 = 0j we stop and -B is finalized (B = {<i,t 2 })- We repeat 
this procedure and add more and more neighbors of B within A to B. Because A is finite, the procedure 
must be stopped for some 1 < k < \A\ where \A\ is the size of A, then B is finalized (B = {t\,t2, •■•,£&})■ 
By the construction of B, tj € £?({ii, for 2 < j < fc, so B is nonempty and connected. 

If £> = A, the claim in the second paragraph is proved. Otherwise, let B 1 = B. Then A \ B 1 ^ 0, 
B 1 and A \ B 1 are exclusive. Then we can replace A in the third paragraph with A \ B 1 and repeat the 
procedure there. Since the orginal A only has finite sites, there exist some m > 1 for A, such that we 
can repeat the procedure in the third paragraph m times to find out those m connected components of A 

m 

denoted by {B : '}JL 1 such that A = B J and {B^yjL^ are mutually exclusive. For the given nonempty 

3=1 

A C S, once its nonempty exclusive connected components {B J }JL 1 are determined, we can always choose 
among them a component which contains the largest number of sites. 

Henceforth, for each nonempty d(si) f)[H lj{ s i> •••) s i-i}L we nrs t its exclusive connected components 

mi 

{Bl.}™i x such that d{s l )f][H c [j{ Sl , s^}} = [j B j Si and then choose among {B^}™^ one of the 

3=1 

largest (meaning the number of sites) and denote it by A Si . □ 



Proof of Proposition POl It is clear by (|2.3|) that for i = 1, X Sl has probability distribution 7r Sl (-), since 
A S1 = 0. 

We next to prove that for 2 < i < N, X Si has probability distribution 7r Si (-), and for any ti <E A Si , 
cov(X Si , X ti ) — /3 Si ,ti- To ease notation, we suppress the subscript i. For x s € X s , by (12.31) . one has that 



u(x s = Xs ) = E n(x s = x s \X As = x As )U(X As = x As ) 

x As GX As 

/ \ i n s (x s )(x s - n is ) \ ^ ST ( n - / \ Ps,tKt(xt){xt - At-sO 



^ , , , j. ^ 2 

7T s (a; s )(x s - /ijfj ^ ^ / y-r ^ 7T t (z t ) (x t - fj^ t ) 



tt s (a; s ), 



< , , , s , , 2 
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where the third equality follows from the equality ^2 XA e x A = ^x A \ {t} ex A t X/£c t ex t an< ^ interchanging 
the order of summations, and the fourth equality holds, because for fixed t € A a , 

Now fix i € ^4 S , we prove cov(X Sl Xt) = (3 s ,t- We compute the joint probability mass function of X s and 
X t . For x s E X s ,a; t 6 X t , we have that by (|2 . 3[) again 



n(Jf,=a:. > Jr t =xt) 

^ II(X S = af a |X t = x t ,X As \ {t} = x As \ {t} ) x n(X t = x t ,X As \ {t] = x As \ {t] ) 

x A s \{t}£X.A a \{t} 

TT s (x s ) Y K( x t = %U X A s \{t}=ZA B \{t}) 

x A s \{t}£ X A s \{t} 

, TTs^sX^s — MS. ) ( TT » / \ Ps,u^u{Xu){x u — flfi ) 
+ —2 2^ (11 

7r " x A„\{t}^ :x -As\{t} u ^ A s \eAs\{u} 



since Ii{X t = Xt,X A \r t \ = x As \s t \) = Tl(X As = x As ) for t 6 A s . Therefore, 



a? 



(6.2) 



n(X s = ^,X t = a; t ) 
7r s (a; s )7r t (a; t ) H » 



E 



H 7To(a;i; 
ueA,\{t} 



Ps,tTTt{x t ){x t - flz t ) 



+ e ( n 2- 



7r s (a; s )(2: s - /U #s ) 



e E n 



7r s (x s )7r t (x 4 ) + 



7r s (a: s )(a: a - ^ s )fi s ^t{x t ){x t - fJL% t ) 



~2 ^2 

%8 7tt 



(6.3) 



where the third equality follows from arguments similar to those in (|6.ip , in particular: 



e e ( n ^(^) 



E a.. E 



7Tt(a;t) • Yl n v (x v ) 
veA s \{t,u} 



a? 



^Oct) E 1 [ ( E ^(^-f) 

nt(xt) /3 s ,m • 

u£A s \{t} 

0. 



i„ex„ 



<7? 



(6.4) 
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Therefore, by (16.31) . we obtain, 

E[(X S - fi»„)(X t - Mtt*)] = (x s - f-ia s )(x t - M7rj7r a (zg)7Tt(xt) 

i,ex, x t ex t 

. Ex s gX s Ex t £X t ^OOfo - &r.) 2 h( x t)( x t - ^tYPs.t 



^2 2 

= (£[X s ]-/i*J(£[X t ]-/i fft ) + &,t. 

(6.5) 

It follows that 

cow(X s ,X t ) = cow(X s - ft^ s ,X t - (Lt# t ) = /3 s , t (6.6) 
□ □ 
Remark 6.1. Note that ([ST3|) follows by and ([63]) . 
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